Overcoming Data Imbalance Problems in Sexual Harassment Classification with SMOTE
نویسندگان
چکیده
Delivery of justice with the help artificial intelligence is a current research interest. Machine learning natural language processing (NLP) can classify types sexual harassment experiences into quid pro quo (QPQ) and hostile work environments (HWE). However, imbalanced data are often present in classes classification on specific datasets. Data imbalance cause decrease classifier's performance because it usually tends to choose majority class. This study proposes implementation evaluation synthetic minority over-sampling technique (SMOTE) improve QPQ HWE classifications experience dataset. The term frequency-inverse document frequency (TF-IDF) method applies weighting process. Then, we compare naïve Bayes K-Nearest Neighbor (KNN) classifying experiences. comparison shows that classifier superior KNN HWE, AUC values 0.95 versus 0.92, respectively. results show by applying SMOTE classifier, precision class increase from 74% 90%.
منابع مشابه
Sexual harassment.
Sexual harassment (SH) is a continuing, chronic occupational health problem in organizations and work environments. First addressed in the Journal of Occupational Health Psychology through a 1998 Special Section on Sexual Harassment, we return to this consequential issue. If the goal is to reduce SH in organizations, and we believe that it should be, then a key question is whether we have made ...
متن کاملWhat's Wrong With Sexual Harassment?
In this article, Professor Franke asks and answers a seemingly simple question: why is sexual harassment a form of sex discrimination under Title VII of the Civil Rights Act of 1964? She argues that the link between sexual harassment and sex discrimination has been undertheorized b9 the Supreme Court. In the absence of a principled theory of the wrong of sexual harassment, Professor Franke argu...
متن کامل36140-Sexual Harassment_54789-Sexual Harassment
e x u a l h a r a s s m e n t o f e i t h e r e m p l o y e e s o r s t u d e n t s i s a v i o l a t i o n o f f e d e r a l a n d s t a t e l a w s . I t i s t h e p o l i c y o f t h e U n i v e r s i t y o f M a i n e S y s t e m t h a t n o m e m b e r o f t h e U n i v e r s i t y S y s t e m c o m m u n i t y m a y s e x u a l l y h a r a s s a n o t h e r...
متن کاملFramework for Prioritizing Solutions in Overcoming Data Quality Problems Using Analytic Hierarchy Process (AHP)
The Central Statistics Agency (BPS) is a government institution that has the authority to carry out statistical activities in the form of censuses and surveys, to produce statistical data needed by the government, the private sector and the general public, as a reference in planning, monitoring, and evaluation of development results. Therefore, providing quality statistical data is very decisiv...
متن کامل36140-Sexual Harassment_54789-Sexual Harassment
e x u a l h a r a s s m e n t o f e i t h e r e m p l o y e e s o r s t u d e n t s i s a v i o l a t i o n o f f e d e r a l a n d s t a t e l a w s . I t i s t h e p o l i c y o f t h e U n i v e r s i t y o f M a i n e S y s t e m t h a t n o m e m b e r o f t h e U n i v e r s i t y S y s t e m c o m m u n i t y m a y s e x u a l l y h a r a s s a n o t h e r...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International journal on information and communication technology
سال: 2022
ISSN: ['2356-5462']
DOI: https://doi.org/10.21108/ijoict.v8i1.622